A generic approach for OCR performance evaluation
نویسنده
چکیده
For different document automation operations it is always needed to have an OCR evaluation phase to select the most interesting OCRs for the document class studied. The evaluation should indicate the defects and drawbacks of each OCR and allow to determine the required heuristics to combine these OCRs in order to obtain the highest performances in production: the lowest reject rate for a predefined confusion rate ( in general 1/10000). The evaluation should be done automatically and completely integrated in a more global OCR platform.
منابع مشابه
Performance Evaluation of Two Arabic OCR Products
Numerous Optical Character Recognition (OCR) companies claim that their products have near-perfect recognition accuracy (close to 99.9%). In practice, however, these accuracy rates are rarely achieved. Most systems break down when the input document images are highly degraded, such as scanned images of carbon-copy documents, documents printed on low-quality paper, and documents that are n-th ge...
متن کاملInformation retrieval for OCR documents: a content-based probabilistic correction model
The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most information retrieval techniques rely heavily on word matching between documents and queries. In this paper, we propose a general content-based correction model that can work on top of an existing OCR correction tool to “boost...
متن کاملA Content-based Probabilistic Correction Model for OCR Document Retrieval
The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most information retrieval techniques rely heavily on word matching between documents and queries. In this paper, we propose a general content-based correction model that can work on top of an existing OCR correction tool to “boost...
متن کاملA Model of Authors’ Generic Competence of EAP Research Articles: A Qualitative Meta-Synthesis Approach
Genre analysis as an area of great concern in recent decades, involves the observation of linguistic features used by a determined discourse community. The research article (RA) is one of the most widely researched genres in academic writing which is realized through some rhetorical moves and discursive steps to achieve a communicative purpose. This study aimed at proposing a model of generic p...
متن کاملEffect generic and non-generic feedback on Motor Learning basketball free throw in Children
Non-generic feedback refers to a specific event and that task performance is the reason to the acquisition of skills and implies that performance is malleable, while generic feedback implies that task performance reflects an inherent ability. The Goal of this study was to determine the generic and non-generic feedback effects on children’s motor learning basketball free throw. This research was...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001